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5 BACKGROUND OF THE INVENTION 

[0001] Multiple sclerosis (MS) is a demyelinizing disease of the central 
nervous system (CNS) of which the complete cause still remains unknown. 

[0002] Numerous studies have supported the hypothesis for a viral etiology 
of the disease, but none of the known viruses tested has proved to be the causative 
10 agent tested for: a review of the viruses tested for in MS for many years has been 
carried out by E. Norrby and R.T. Johnson. 

[0003] Recently, a retrovirus, different from the known human retroviruses, 
was isolated from patients suffering from MS. The authors were able to show that this 
retrovirus could be transmitted in vitro, that patients suffering from MS produced 
1 5 antibodies capable of recognizing proteins associated with the infection of the 

leptomeningeal cells by this retrovirus, and that the expression of the latter could be 
greatly stimulated by the immediate-early genes of some herpesviruses. 

[0004] All these results argue in favor of the role in MS of at least one 
unknown retrovirus or of a virus having a reverse transcriptase (RT) activity which is 
20 detectable by the method published by H. Perron and termed "LM7-type RT" activity. 

[0005] The studies by the applicant have made it possible to obtain two 
continuous cell lines infected v^th natural isolates obtained from two different 
patients suffering from MS, by a culture method as described in the document 
WO-A-93 20188, whose content is incorporated by reference into the present 
25 description. These two lines derived from cells of himian choroid plexus, called 

LM7PC and PLI-2, were deposited at the E.C.A.C.C. on 22 July 1992 and 8 January 
1993, respectively, under numbers 92 072201 and 93 010817, in accordance v^th the 
provisions of the Treaty of Budapest. Moreover, the viral isolates possessing an 
LM7-type RT activity have also been deposited at the E.C.A.C.C, under the overall 
30 name of "strains". The "strain" or isolate harbored by the PLI-2 line, called POL-2, 

was deposited at the E.C.A.C.C. on 22 July 1992 under No. V92072202. The "strain" 
or isolate harbored by the LM7PC line, called MS7PG, was deposited at the 
E.C.A.C.C. on 8 January 1993 under No. V93010816. 
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[0006] Using the above-mentioned cultures and isolates, characterized by 
biological and morphological criteria, efforts were then made to characterize the 
genetic material associated with the viral particles produced in these cultures. 

[0007] The proportions of genome already characterized were used to 
5 develop molecular detection tests for the viral genome and immunoserological tests, 
using the amino acid sequences encoded by the nucleotide sequences of the viral 
genome, in order to detect the inmiune response directed against epitopes associated 
with the viral infection and/or expression. 

[0008] These tools have already made it possible to confirm an association 

10 between MS and the expression of the sequences identified in the patents cited fiirther 
on. However, the viral system discovered by the applicant is related to a complex 
retroviral system. Indeed, the sequences which are found to be encapsidated in the 
extracellular viral particles produced by the different cultures of cells of patients 
suffering firom MS show clearly that there is co-encapsidation of retroviral genomes 

15 which are related but different from the "wild-type" retroviral genome which produces 
the infectious viral particles. This phenomenon was observed between replicative 
retroviruses and endogenous retroviruses belonging to the same family, or even 
heterologous retroviruses. The concept of endogenous retrovirus is very important in 
the context of our discovery because, in the case of MSRV-1, it has been observed 

20 that endogenous retroviral sequences comprising sequences homologous to the 
MSRV-1 genome exist in normal human DNA. The existence of endogenous 
retroviral elements (ERV) related to MSRV-1 through all or part of their genome 
explains the fact that the expression of the MSRV-1 retrovirus in human cells can 
interact with related endogenous sequences. These interactions are found in the case 

25 of pathogenic and/or infectious endogenous retroviruses (for example some ecotropic 
strains of the Murine Leukemia virus), in the case of exogenous retroviruses whose 
nucleotide sequence may be found partially or completely in the form of ERVs, in the 
genome of the host animal (e.g. mouse mammary tumor exogenous virus transmitted 
via milk). These interactions consist mainly of (i) a transactivation or co-activation of 

30 ERVs by the replicative retrovirus, (ii) an "illegitimate" encapsidation of related 
RNAs of ERVs, or of ERVs - or even of cellular RNAs - simply possessing 
compatible encapsidation sequences, into the retroviral particles produced by the 
expression of the replicative strain, which are sometimes transmissible and sometimes 
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with an inherent pathogenicity, and (iii) relatively high recombinations between the 
co-encapsidated genomes, in particular in the reverse transcription phases, which lead 
to the formation of hybrid genomes, which are sometimes transmissible and 
sometimes with an inherent pathogenicity. 
5 [0009] Thus, (i) various MSRV-1 -related sequences have been found in 

pxirified viral particles; (ii) molecular analysis of the various regions of the MSRV-1 
retroviral genome should be carried out by systematically analyzing the 
co-encapsidated, interfering and/or recombinant sequences which are generated by the 
infection and/or expression of MSRV-1; furthermore, some clones may have portions 

10 of defective sequences produced by the retroviral replication and the template and/or 
transcription errors caused by reverse transcriptase; (iii) the families of sequences 
related to the same retroviral genomic region are the supports for an overall diagnostic 
detection which may be optimized by the identification of invariable regions among 
the clones expressed and by the identification of reading frames responsible for the 

15 production of antigenic and/or pathogenic polypeptides which may only be produced 
by a portion, or even only one, of the clones expressed and under these conditions, the 
systematic analysis of the clones expressed in one region of a given gene makes it 
possible to evaluate the frequency of variation and/or recombination of the MSRV-1 
genome in this region and to define the optimum sequences for the applications, in 

20 particular the diagnostic applications; (iv) the pathology caused by a retrovims such as 
MRSV-1 may be a direct effect of its expression and of the proteins or peptides 
produced as a result, but also an effect of the activation, encapsidation, recombination 
of related or heterologous genomes and proteins or peptides produced as a result; thus, 
these genomes eissociated with the expression and/or infection by MSRV-1 are an 

25 integral part of the potential pathogenicity of this virus and therefore constitute 

diagnostic detection supports and particular therapeutic targets. Likewise, any agent 
which is associated with, or which is a cofactor for these interactions responsible for 
the pathogenicity in question, such as MSRV-2 or the gliotoxic factor described in the 
patent application published under the No. FR-2,716,198, can participate in the 

30 development of an overall and very effective strategy for therapeutic diagnosis, 

prognosis, monitoring and/or integrated therapy for MS in particular, but also for any 
other disease associated with the same agents. 

[0010] In this context, a parallel discovery has been made in another 
autoimmxme disease, rheumatoid arthritis (RA), which has been described in the 



French patent application published under the No. FR-2,73 1,356. This discovery 
shows that, by applying methodological approaches similar to those which were used 
in the studies by the applicant on MS, it has been possible to identify a retrovirus 
expressed in RA which shares the sequences described for MSRV-1 in MS and also 
the coexistence of an MSRV-2-associated sequence which is also described in MS. 
As regards MSRV-1, the sequences commonly detected in MS and RA relate to the 
pol and gag genes. On the basis of current knowledge, it is possible to combine the 
gag and pol sequences described with the MSRV-1 strains expressed in these two 
diseases. 

[0011] The present patent application has as its object various results, 
supplementary in relation to those already protected by the French patent applications: 

No. 92/04322 of 03.04.1992, pubUshed under No. 2,689,519; 

No. 92/13447 of 03.11.1992, published under No. 2,689,521; 

No. 92/13443 of 03,11.1992, published under No. 2,689,520; 

No. 94/01529 of 04.02.1994, published under No. 2,715,936; 

No. 94/01531 of 04.02.1994, pubUshed under No. 2,715,939; 

No. 94/01530 of 04.02.1994, published under No. 2,715,938; 

No. 94/01532 of 04.02.1994, published under No. 2,715,937; 

No. 94/14322 of 24.11.1994, published under No. 2,727,428; 

No. 94/15810 of 23.12.1994, published under No. 2,728,585; and 

Patent Application WO 97/06260. 

SUMMARY OF THE INVENTION 

[0012] The present invention relates, first of all, to a nucleic material, which 
may consist of a retroviral material, in isolated or purified state, which may be 
understood or characterized in various ways: 

- it comprises a nucleotide sequence chosen from the group which consists of 
(i) the sequences SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 9, SEQ ID NO: 12, 
SEQ ID NO: 1 6, SEQ ID NO: 21 , SEQ ID NO: 30 and SEQ ID NO: 3 1 ; (ii) the 
sequences complementary to sequences (i); and (iii) the sequences equivalent to 
sequences (i) or (ii), in particular the sequences having, for every series of 100 
contiguous monomers, at least 50%, and preferentially at least 70% homology with 
sequences (i) or (ii) respectively; 

- it encodes a polypeptide having, for every contiguous series of at least 30 
amino acids, at least 50%, and preferably at least 70% homology with a peptide 
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sequence chosen from the group which consists of SEQ ID NO: 5, SEQ ID NO: 7, 
SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 25 and SEQ ID NO: 26; 

- its pol gene comprises a nucleotide sequence identical or equivalent to a 
sequence chosen from the group which consists of SEQ ID NO: 4, SEQ ID NO: 16 

5 and their complementary sequences; 

- the 5* end of its pol gene starts at nucleotide 1419 of SEQ ID NO: 21; 

- its pol gene encodes a polypeptide having, for every contiguous series of at 
least 30 amino acids, at least 50%, and preferably at least 70% homology with the 
peptide sequence SEQ ID NO: 5; 

10 - the 3' end of its gag gene ends at nucleotide 1418 of SEQ ID NO: 21 ; 

- its env gene comprises a nucleotide sequence identical or equivalent to a 
sequence chosen from the group which consists of SEQ ID NO: 9, and its 
complementary sequences; 

- its env gene comprises a nucleotide sequence which starts at nucleotide 1 of 
1 5 SEQ ID NO: 9 and ends at nucleotide 233 of SEQ ID NO: 6; 

- its env gene encodes a polypeptide having, for every contiguous series of at 
least 30 amino acids, at least 50%, and preferably at least 70% homology with the 
sequence SEQ ID NO: 10; 

- the U3R region of its 3* LTR comprises a nucleotide sequence which ends at 
20 nucleotide 617 of SEQ ID NO: 6; 

- the RU5 region of its 5* LTR comprises a nucleotide sequence which starts at 
nucleotide 755 of SEQ ID NO: 12 and ends at nucleotide 337 of SEQ ID NO: 30 or 
SEQ ID NO: 31; 

- a retroviral nucleic material comprising a sequence which starts at 

25 nucleotide 755 of SEQ ID NO: 12 and which ends at nucleotide 61 7 of SEQ ID 
NO: 6; 

- the retroviral nucleic material as defined above is in particular associated 
with at least one autoimmime disease such as multiple sclerosis or rheumatoid 
arthritis. 

30 [0013] The invention also relates to a nucleotide fragment which corresponds 

to at least one of the following definitions: 

- it comprises or consists of a nucleotide sequence chosen from the group 
which consists of (i) the sequences SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 9, 
SEQ ID NO: 12, SEQ ID NO: 16, SEQ ID NO: 21, SEQ ID NO: 30 and SEQ ID 
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NO: 31; (ii) the sequences complementary to sequences (i); and (iii) the sequences 
equivalent to sequences (i) or (ii), in particular the sequences having, for every series 
of 100 contiguous monomers, at least 50%, and preferentially at least 70% homology 
with sequences (i) or (ii) respectively; 
5 - it comprises or consists of a nucleotide sequence encoding a polypeptide 

having, for every contiguous series of at least 30 amino acids, at least 50%, and 
preferably at least 70% homology with a peptide sequence chosen from the group 
which consists of SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13, 
SEQ ID NO: 25 and SEQ ID NO: 26. 
1 0 [0014] Other subjects of the present invention are the following: 

- a nucleic probe for the detection of a retrovirus associated with multiple 
sclerosis and/or rheumatoid arthritis, capable of hybridizing specifically with any 
fragment defined above and belonging to the genome of said retrovirus; it 
advantageously possesses from 10 to 100 nucleotides, preferably from 10 to 30 

1 5 nucleotides; 

- a primer for the amplification, by polymerization, of an RNA or of a DNA of 
a retrovirus associated with multiple sclerosis and/or rheumatoid arthritis, which 
comprises a nucleotide sequence identical or equivalent to at least a portion of the 
nucleotide sequence of a fragment defined above, in particular a nucleotide sequence 

20 having, for every series of 10 contiguous monomers, at least 50%, preferably at least 
70% homology with at least said portion of said fragment; preferably the nucleotide 
sequence of a primer of the invention is chosen from SEQ ID NO: 8, SEQ ID NO: 1 1 , 
SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 23, and SEQ ID NO: 24; 

25 - an RNA or a DNA and in particular a replication and/or expression vector, 

comprising a genomic fragment of the nucleic material or a fragment defined above; 

- a peptide encoded by any open reading frame belonging to a nucleotide 
fragment defined above, in particular a polypeptide, for example oligopeptide forming 
or comprising an antigenic determinant recognized by sera of patients infected with 

30 the MSRV-1 virus, and/or in whom the MSRV-1 virus has been reactivated; a 
preferential peptide comprises a sequence identical, partially or completely, or 
equivalent to a sequence chosen from SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 10, SEQ ID NO: 13, SEQ ID NO: 25 and SEQ ID NO: 26; 
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- a diagnostic, prophylactic or therapeutic composition, in particular for 
inhibiting the expression of at least one retrovirus associated with multiple sclerosis 
and/or rheumatoid arthritis, comprising a nucleotide fragment defined above; 

- a method for detecting a retrovirus associated with multiple sclerosis and/or 
5 rheumatoid arthritis, in a biological sample, comprising the steps consisting of 

bringing an RNA and/or a DNA assumed to belong to or obtained from said 
retrovirus, or their complementary RNA and/or DNA, into contact with a composition 
comprising a nucleotide fragment defined above. 

DEFINITIONS 

10 [0015] Before detailing the invention, various terms used in the description 

and the claims are now defined. 

[0016] Strain or isolate is understood to mean any infectious and/or 
pathogenic biological fraction containing, for example, viruses and/or bacteria and/or 
parasites, generating a pathogenic and/or antigenic power, harbored by a culture or a 

1 5 live host; by way of example, a viral strain according to the preceding definition may 
contain a co-infectious agent, for example a pathogenic protist. 

[0017] The term "MSRV" used in the present description designates any 
pathogenic and/or infectious agent, as associated with MS, in particular a viral 
species, the attenuated strains of said viral species, or the interfering defective 

20 particles or particles containing co-encapsidated genomes or alternatively genomes 
recombined with a portion of the MSRV-1 genome, which are derived firom this 
species. It is known that viruses and particularly viruses containing RNA exhibit 
variability, following in particular relatively high rates of spontaneous mutation, 
which will be taken into account below to define the concept of equivalence. 

25 [0018] Human virus is understood to mean a virus capable of infecting or of 

being harbored by human beings. 

[0019] Given all the natural or induced variations and/or recombination 
which may be encoimtered in practice in the present invention, the objects thereof, 
defined above and in the claims, have been expressed by comprising the equivalents 

30 or derivatives of the various biological materials defined below, in particular 
homologous nucleotide or peptide sequences. 

[0020] The variant of a virus or of a pathogenic and/or infectious agent 
according to the invention comprises at least one antigen recognized by at least one 
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antibody directed against at least one corresponding antigen of said virus and/or of 
said pathogenic and/or infectious agent, and/or a genome in which any portion is 
detected by at least one hybridization probe, and/or at least one nucleotide 
amplification primer specific for said virus and/or pathogenic and/or infectious agent, 
5 under defined hybridization conditions well known to persons skilled in the art. 
[0021] According to the invention, a nucleotide fragment or an 
oligonucleotide or a polynucleotide is a stretch of monomers, or a biopolymer, 
characterized by the informational sequence of the natural nucleic acids, which is 
capable of hybridizing to any other nucleotide fragment under predefined conditions, 

10 it being possible for the stretch to contain monomers of different chemical stmctures 
and to be obtained from a natural nucleic acid molecule and/or by genetic 
recombination and/or by chemical synthesis; a nucleotide fragment may be identical to 
a genomic fragment of the MSRV-1 virus considered by the present invention, in 
particular a gene of the latter, for example pol or env in the case of said virus. 

1 5 [0022] Thus, a monomer may be a natural nucleic acid nucleotide in which 

the constituent components are a sugar, a phosphate group and a nitrogen base; in 
RNA, the sugar is ribose; in DNA, the sugar is 2-deoxyribose; depending on whether 
DNA or RNA is involved, the nitrogen base is chosen from adenine, guanine, uracil, 
cytosine, thymine; or the nucleotide may be modified in at least one of the three 

20 constituent components; by way of example, the modification may occur at the level 
of the bases, generating modified b£ises such as inosine, 5-methyl-deoxycytidine, 
deoxyuridine, 5-dimethylaminodeoxyuridine, 2,6-diaminopurine, 
5-bromodeoxyuridine and any other modified base promoting hybridization; at the 
level of the sugar, the modification may consist in the replacement of at least one 

25 deoxyribose with a polyamide, and at the level of the phosphate group, the 

modification may consist in its replacement with esters, in particular chosen from the 
esters of diphosphate, of alkyl and arylphosphonate and of phosphorothioate. 

[0023] "Informational sequence" is understood to mean any ordered series of 
monomers, whose chemical nature and in which the order in a reference direction, 

30 constitute or otherwise a functional information of the same quality as that for the 
natural nucleic acids. 

[0024] Hybridization is understood to mean the process during which, imder 
appropriate operating conditions, two nucleotide fragments, having sufficiently 
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complementary sequences, become annealed to fomi a complex, in particular a double 
or triple, structure, preferably in helical form. 

[0025] A probe comprises a nucleotide fragment synthesized by the chemical 
route or obtained by digestion or enzymatic cleavage of a longer nucleotide fragment, 
5 comprising at least six monomers, advantageously from 10 to 100 monomers, 
preferably 10 to 30 monomers, and possessing a hybridization specificity under 
defined conditions; preferably, a probe possessing less than 10 monomers is not used 
alone, but is used in the presence of other probes which are equally short in length or 
otherwise; under certain specific conditions, it may be useful to use probes which are 

10 greater than 100 monomers in size; a probe may be used in particular for diagnostic 
purposes, and it may be, for example, capture and/or detection probes. 

[0026] The captxire probe may be immobilized on a solid support by any 
appropriate means, that is to say directly or indirectly, for example by covalent 
bonding or passive adsorption. 

1 5 [0027] The detection probe may be labeled by means of a marker chosen in 

particular from radioactive isotopes, enzymes chosen in particular from peroxidase 
and alkaline phosphatase and those capable of hydrolyzing a chromogenic, fluorigenic 
or luminescent substrate, chromophoric chemical compoimds, chromogenic, 
fluorigenic or luminescent compounds, analogs of nucleotide bases, and biotin. 

20 [0028] The probes used for diagnostic purposes of the invention may be used 

in all knovra hybridization techniques, and in particular the so-called "DOT-BLOT" 
technique, "SOUTHERN BLOT" technique, "NORTHERN BLOT" technique which 
is a technique identical to the "SOUTHERN BLOT" technique but which uses RNA 
as target, the SANDWICH technique; advantageously, the SANDWICH technique is 

25 used in the present invention, comprising a specific capture probe and/or a specific 
detection probe, it being understood that the capture probe and the detection probe 
must have a nucleotide sequence which is at least partially different. 

[0029] Any probe according to the present invention may hybridize in vivo 
or in vitro v^th the RNA and/or with the DNA, in order to block the replication, in 

30 particular translation and/or transcription, phenomena and/or to degrade said DNA 
and/or RNA. 

[0030] A primer is a probe comprising at least six monomers, and 
advantageously fi-om 10 to 30 monomers, possessing hybridization specificity under 
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defined conditions, for the initiation of an enzymatic polymerization, for example in 
an amplification technique such as PGR (Polymerase Chain Reaction), in an extension 
method such as sequencing, in a reverse transcription method and the like. 

[0031] Two nucleotide or peptide sequences are said to be equivalent or 
5 derived with respect to each other, or with respect to a reference sequence, if 

functionally the corresponding biopolymers can play substantially the same role, 
without being identical, in relation to the application or use considered, or in the 
technique in which they are involved; particularly equivalent are two sequences 
obtained because of the natural variability, in particular spontaneous mutation, of the 

10 species fi*om which they were identified, or induced mutation, as well as two 
homologous sequences, the homology being defined below. 

[0032] "Variability" is understood to mean any spontaneous or induced 
modification of a sequence, in particular by substitution, and/or insertion, and/or 
deletion of nucleotides and/or of nucleotide fragments, and/or extension and/or 

1 5 shortening of the sequence at least at one of the ends; a nonnatural variability may 
result from the genetic engineering techniques used, for example from the choice of 
the degenerate or nondegenerate synthetic primers selected to amplify a nucleic acid; 
this variability may result in modifications of any starting sequence, considered as a 
reference, and which may be expressed by a degree of homology with respect to said 

20 reference sequence. 

[0033] Homology characterizes the degree of identity of two compared 
nucleotide or peptide fragments; it is measured by the percentage identity which is in 
particular determined by direct comparison of nucleotide or peptide sequences, with 
respect to reference nucleotide or peptide sequences. 

25 [0034] Any nucleotide fragment is said to be equivalent to or derived from a 

reference fragment if it has a nucleotide sequence equivalent to the sequence of the 
reference fragment; according to the preceding definition, in particular equivalent to a 
reference nucleotide fragment are: 

(a) any fragment capable of hybridizing, at least partially, with the 
30 complementary to the reference fragment, 

(b) any fragment whose alignment with the reference fragment leads to 
the identification of identical contiguous bases, in a greater number than with any 
other fragment obtained from another taxonomic group, 
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(c) any fragment resulting or capable of resulting from the natural 
variability of the species from which it is obtained, 

(d) any fragment which may result from genetic engineering techniques 
applied to the reference fragment, 

5 (e) any fragment, containing at least eight contiguous nucleotides, 

encoding a peptide homologous or identical to the peptide encoded by the reference 
fragment, 

(f) any fragment different from the reference fragment through insertion, 
deletion, substitution of at least one monomer, extension, or shortening at least at one 
10 of its ends; for example, any fragment corresponding to the reference fragment, 

flanked at least at one of its ends by a nucleotide sequence not encoding a polypeptide. 

[0035] Polypeptide is imderstood to mean in particular any peptide of at least 
two amino acids, in particular oligopeptide, protein, extracted, separated, or 
substantially isolated or synthesized, through the involvement of humans, in particular 
15 those obtained by chemical synthesis, or through expression in a recombinant 
organism. 

[0036] Polypeptide partially encoded by a nucleotide fragment is understood 
to mean a polypeptide having at least three amino acids encoded by at least nine 
contiguous monomers included in said nucleotide fragment. 
20 [0037] An amino acid is said to be analogous to another amino acid when 

their respective physicochemical characteristics, such as polarity, hydrophobicity 
and/or basicity, and/or acidity, and/or neutrality, are substantially the same; thus, a 
leucine is analogous to an isoleucine. 

[0038] Any polypeptide is said to be equivalent to or derived from a 
25 reference polypeptide if the polypeptides compared have substantially the same 

properties, and in particular the same antigenic, immimological, enzymatic and/or 
molecular recognition properties; in particular equivalent to a reference polypeptide is: 

(a) any polypeptide possessing a sequence in which at least one amino 
acid has been replaced by an analogous amino acid, 
30 (b) any polypeptide having an equivalent peptide sequence, obtained by 

natural or induced variation of said reference polypeptide, and/or of the nucleotide 
fragment encoding said polypeptide, 

(c) a mimotope of said reference polypeptide. 
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(d) any polypeptide from whose sequence one or more amino acids of the 
L series are replaced by an amino acid of the D series, and vice versa, 

(e) any polypeptide into whose sequence a modification of the side 
chains of the amino acids has been introduced, such as for example an acetylation of 
the amine-containing functions, a carboxylation of the thiol functions, an esterification 
of the carboxyl functions, 

(f) any polypeptide in whose sequence one or more peptide bonds have 
been modified, such as for example the carba, retro, inverso, retro-inverso, reduced, 
and methylene-oxy bonds, 

(g) any polypeptide in which at least one antigen is recognized by an 
antibody directed against a reference polypeptide. 

[0039] The percentage identity characterizing the homology between two 
peptide fragments compared is according to the present invention at least 50% and 
preferably at least 70%. 

[0040] Given that a vims possessing a reverse transcriptase enzymatic 
activity may be genetically characterized both in RNA and DNA form, both the viral 
DNA and RNA will be mentioned in order to characterize the sequences relative to a 
vims possessing such a reverse transcriptase activity, termed MSRV-1 according to 
the present description. 

[0041] The expressions of order which are used in the present description 
and the claims, such as "first nucleotide sequence", are not selected to express a 
particular order, but to define the invention more clearly. 

[0042] Detection of a substance or agent is understood below to mean an 
identification, a quantification or a separation or isolation of said substance or of said 
agent. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0043] The invention will be understood more clearly on reading the detailed 

description which follows which is made with reference to the appended Figures. 

[0044] Figure 1 represents the general stmcture of the proviral DNA and the 

genomic RNA of MSRV-1. 

[0045] Figure 2 represents the nucleotide sequence of the clone called CL6-5* 

(SEQ ID NO: 4) and three potential reading frames in amino acids presented under the 

nucleotide sequence. 
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[0046] Figure 3 represents the nucleotide sequence of the clone called CL6-3* 
(SEQ ID NO: 6) and three potential reading frames in amino acids presented under the 
nucleotide sequence. 

[0047] Figure 4 represents the nucleotide sequence of the clone called CI 5 
5 (SEQ ID NO: 9) and three potential reading frames in amino acids presented imder the 
nucleotide sequence. 

[0048] Figure 5 represents the nucleotide sequence of the clone called 5M6 
(SEQ ID NO: 12) and three potential reading frames in amino acids presented under 
the nucleotide sequence. 
10 [0049] Figure 6 represents the nucleotide sequence of the clone called CL2 

(SEQ ID NO: 21) and three potential reading frames in amino acids presented under 
the nucleotide sequence. 

[0050] Figure 7 represents a potential reading frame in amino acids 
expressed by pET28C-clone 2 (SEQ ID NO: 40) and presented under the nucleotide 
15 sequence. 

[0051] Figure 8 represents a potential reading frame in amino acids 
expressed by pET21C-clone 2 (SEQ ID NO: 41) and presented under the nucleotide 
sequence. 

[0052] Figure 9 represents the nucleotide sequence of the clone called LB 1 3 
20 (SEQ ID NO: 30) and three potential reading frames in amino acids presented under 

the nucleotide sequence. 

[0053] Figure 10 represents the nucleotide sequence of the clone called 

LAI 5 (SEQ ID NO: 31) and three potential reading frames in amino acids presented 

under the nucleotide sequence. 
25 [0054] Figure 1 1 represents the nucleotide sequence of the clone called LB 1 6 

(SEQ ID NO: 16) and three potential reading frames in amino acids presented under 

the nucleotide sequence. 

[0055] Figure 12 represents the promoter activity expressed in cpm/4 min of 

the U3R sequences subcloned from LTRs of different origins into the piasmid 
30 PCAT3. PCAT3 means piasmid alone, PCAT-PH74 means piasmid plus endogenous 

U3R clone expressed in the placenta, PCATcl6 means piasmid plus U3R clone 

amplified in the RNA of an MS plasma, PCAT-5M6 means piasmid plus U3R region 

amplified in the cellular DNA, "no piasmid" means absence of piasmid in the test. 
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[0056] Figure 13 represents the MSRVl env and 3' LTR sequences (which 
together form SEQ ID NO: 42). The horizontal arrows indicate the start of the env, 
U3 and R regions. In the env region, the signal peptide and the potential 
immunosuppressive region are underlined, the potential glycosilation sites are boxed 
5 and the potential cleavage sites are indicated by vertical arrows. In the U3R region: 
the regulatory element CART and the TATA Box are underlined, the "cap" site and 
the polyadenylation signal are also indicated. 

[0057] Figure 14 represents the 5* LTR (RU5) region followed by a PBS site 
(primer binding site) complementary to the Trp tRNA and by a gag gene encoding a 

10 protein of about 487 amino acids (which together form SEQ ID NO: 43). The amino 
acids conserved in the nucleocapsid are underlined twice. The amino acids defining 
the region of greatest homology in the capsid are in bold and underlined once. The / 
symbols in the amino acid sequence indicate variations observed depending on the 
clones and, in the nucleotide sequence, they indicate frame jumps in some clones. 

1 5 The boxed regions correspond to epitopes identified by peptide analysis of the 
C-terminal region, 

[0058] Figure 15 represents the integrase region of MSRVl (SEQ ID 
NO: 44), the nucleotide sequence and the amino acid sequence deduced from the 
integrase region corresponding to clone 87-23. In Figure 15, // means a frame jump 

20 which has been suppressed in order to restore the potential ORF. The letters in 
underlined bold characters represent the conserved amino acids in the retroviral 
integrases. 

[0059] Figure 16 describes the nucleotide and peptide sequences of clone 
B13 (identical to clone FBdl3 described in previous applications) with indication of 
25 the ORFs and stop codons represented by a dot (SEQ ID NO: 45). The underlined 
region in bold represents the potential inununosuppressive domain. The single 
underlined domain represents the start of the 3* LTR. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
[0060] EXAMPLE 1: PREPARATION OF A CL6-5* REGION 
30 ENCODING THE N-TERMINAL END OF INTEGRASE AND OF A CL6-3* 
REGION CONTAINING THE 3' TERMINAL SEQUENCE OF THE MSRV-1 
GENOME 
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[0061] A 3* RACE was carried out on the total RNA extracted from plasma 
from a patient suffering from MS. A healthy control plasma, treated under the same 
conditions, was used as negative control. The synthesis of cDNA was carried out with 
an oligo dT primer identified by SEQ ID NO: 1 (5* GAC TCG CTG CAG ATC GAT 
5 TTT TTT TTT TTT TTT T 3') and the reverse transcriptase "Expand™ RT" from 
Boehringer according to the conditions recommended by the company. A PGR was 
carried out with the enzyme Klentaq (Clontech) under the following conditions: 94°C 
5 min then 93°C 1 min, SS^'C 1 min, 68°C 3 min over 40 cycles and 68°C for 8 min, 
with a final reaction volume of 50 |al. 
1 0 [0062] Primers used for the PGR: 

- 5' primer, identified by SEQ ID NO: 2 

5' GCC ATC AAG CCA CCC AAG AAC TCT TAA CTT 3'; 

- 3* primer, identified by SEQ ID NO: 1 

[0063] A second so-called "seminested" PGR was carried out with a 5* 
1 5 primer situated inside the region already amplified. This second PGR was carried out 
under the same experimental conditions as those used for the first PGR, using 10 gl of 
the amplification product derived from the first PGR. 

[0064] Primers used for the seminested PGR: 

- 5' primer, identified by SEQ ID NO: 3 

20 5* CCA ATA GGG AGA CCA TTA TAT AGA CTA ATT 3*; 

- 3' primer, identified by SEQ ID NO: 1 

[0065] The primers SEQ ID NO: 2 and SEQ ID NO: 3 are specific for the 
pol region ofMRSV-1. 

[0066] An amplification product of 1 .9 Kb was obtained for the plasma of 
25 the MS patient. The corresponding fragment was not observed for the healthy control 
plasma. This amplification product was cloned in the following manner: 

[0067] The amplified DNA was inserted into a plasmid with the aid of the 
TA Cloning kit®. The 2 [xl of DNA solution were mixed with 5 |il of sterile distilled 
water, 1 ^il of a 10 times concentrated ligation buffer "lOX LIGATION BUFFER", 
30 2 [il of "pGR™ VECTOR" (25 ng/ml) and 1 ^il of "T4 DNA LIGASE". This mixture 
was incubated overnight at 12°C. The next steps were carried out in accordance with 
the instructions for the TA Cloning kite (Invitrogen). At the end of the procedure, the 
white colonies of recombinant bacteria (white) were subcultured so as to be cultured 
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and allow the extraction of the plasmids incorporated according to the so-called 
"miniprep" procedure. The plasmid preparation of each recombinant colony was cut 
with an appropriate restriction enzyme and analyzed on agarose gel. The plasmids 
possessing an insert detected under UV light after staining the gel with ethidium 
5 bromide were selected for the sequencing of the insert after hybridization with a 

primer complementary to the Sp6 promoter present on the cloning plasmid of the TA 
cloning kit®. The reaction prior to the sequencing was then carried out according to 
the method recommended for using the sequencing kit "PRISM™ Ready Reaction 
AmpliTaq° FS, DyeDeoxy™ Terminator" (Applied Biosystems, ref 4021 19) and the 

10 automated sequencing was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's instructions. 

[0068] The clone obtained contains a CL6-5' region encoding the N-terminal 
end of integrase and a CL6-3* region corresponding to the 3' terminal region of 
MSRV-1 and making it possible to define the end of the envelope (234 bp) and the U3 

1 5 and R (401 bp) regions of the MSRVl retrovirus. 

[0069] The region corresponding to the N-terminal end of integrase is 
represented by its nucleotide sequence (SEQ ID NO: 4) in Figure 2. The three 
potential reading fi'ames are presented by their Eimino acid sequence imder the 
nucleotide sequence, and the amino acid sequence of the N-terminal end of integrase 

20 is identified by SEQ ID NO: 5. 

[0070] The CI 6-3* region is represented by its nucleotide sequence (SEQ ID 
NO: 6) in Figure 3. The three potential reading frames are presented by their amino 
acid sequence under the nucleotide sequence. An amino acid sequence corresponding 
to the C-terminal end of the MSRV-1 env protein is identified by SEQ ID NO: 7. 

25 [0071] In order to evaluate the promoter activity of the LTR obtained from 

clone 6 (cl6), a test of promoter activity using the enzyme CAT (chloramphenicol 
acetyl transferase) was carried out with the corresponding U3R region. In parallel, a 
clone containing the same U3R region of endogenous retroviral RNA expressed in 
normal placenta (PH74) and a clone (5M6) obtained from DNA were tested. The 

30 result presented in Figure 12 shows a very high promoter activity of the LTR derived 
from MS plasma (cl6) and a significantly much lower activity with the sequences of 
non-MS endogenous origin. 
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[0072] EXAMPLE 2 ; PREPARATION OF THE CI 5 CLONE 
CONTAINING THE REGION ENCODING A PORTION OF THE MSRV-1 
RETROVIRUS ENVELOPE 

[0073] A RT-PCR was carried out on the total RNA extracted from virions 
concentrated by ultra-centrifixgation of a synoviocyte culture supernatant obtained 
from an MS patient. The synthesis of cDNA was carried out with an oligo dT primer 
and the reverse transcriptase "Expand™ RT" from Boehringer according to the 
conditions recommended by the company. A PCR was carried out with the Expand^^ 
Long Template PCR System (Boehringer) under the following conditions: 94°C 5 min 
then 93°C 1 min, 60*^0 1 min, 68^*0 3 min over 40 cycles and 68^*0 for 8 min and with 
a final reaction volume of 50 nl. 

[0074] Primers used for the PCR: 

- 5' primer, identified by SEQ ID NO: 2 

5* GCC ATC AAG CCA CCC AAG AAC TCT TAA CTT 3'; 

- 3* primer, identified by SEQ ID NO: 8 

5' TGG GGT TCC ATT TGT AAG ACC ATC TGT AGC TT 3' 

[0075] A second so-called "seminested" PCR was carried out with a 5' 
primer situated inside the region already amplified. This second PCR was carried out 
under the same experimental conditions as those used for the first PCR (except that 30 
cycles were used instead of 40), using 10 ^il of the amplification product derived from 
the first PCR. 

[0076] Primers used for the seminested PCR: 

- 5' primer, identified by SEQ ID NO: 3 

5* CCA ATA GCC AGA CCA TTA TAT ACA CTA ATT 3'; 

- 3' primer, identified by SEQ ID NO: 8 

[0077] The primers SEQ ID NO: 2 and SEQ ID NO: 3 are specific for the 
pol region of MRSV-1 . The primer SEQ ID NO: 8 is specific for the sequence FBdl3 
(also called B13) and is located in the conserved env region among the 
oncoretroviruses. 

[0078] An amplification product of 1 932 by was obtained and cloned in the 
following manner: 

the amplified DNA was inserted into a plasmid with the aid of the TA 
Cloning kit®. The various steps were carried out in accordance with the instructions 
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for the TA Cloning kit (Invitrogen). At the end of the procedure, the white colonies 
of recombinant bacteria (white) were subcultured so as to be cultured and allow the 
extraction of the plasmids incorporated according to the so-called "miniprep" 
procedure. The plasmid preparation of each recombinant colony was cut with an 
5 appropriate restriction enzyme and analyzed on agarose gel. The plasmids possessing 
an insert detected under UV light after staining the gel with ethidium bromide were 
selected for the sequencing of the insert after hybridization with a primer 
complementary to the SP6 promoter present on the cloning plasmid of the TA cloning 
kit®. The reaction prior to the sequencing was then carried out according to the 

10 method recommended for using the sequencing kit "PRISMTM Ready Reaction 

AmpliTaqR FS, DyeDeoxy™ Terminator" (Applied Biosystems, ref. 4021 19) and the 
automated sequencing was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's instructions. 

[0079] The CI 5 clone obtained contains a region corresponding to the region 

15 of the MSRV- 1 envelope of 1 48 1 bp. 

[0080] The env region of the CI 5 clone is represented by its nucleotide 
sequence (SEQ ID NO: 9) in Figure 4. The three potential reading frames of this 
clone are presented by their amino acid sequence vinder the nucleotide sequence. The 
reading frame corresponding to an MSRV-1 structural env protein is identified by 

20 SEQ ID NO: 10. 

[0081] From the defined sequences obtained from clones cl6 and CI 5, it was 
possible to produce a plasmid construct encoding a complete envelope followed by 
the 3' LTR, as presented in Figure 13 with the corresponding reading frame. 
[0082] EXAMPLE 3 ; PREPARATION OF A 5M6 CLONE 

25 CONTAINING THE SEQUENCES OF THE 3* TERMINAL REGION OF THE 

ENVELOPE, FOLLOWED BY THE MSRV-1 PROVIRAL TYPE U3, R AND U5 
SEQUENCES 

[0083] A monodirectional PCR was carried out on the DNA extracted from 
immortalized B lymphocytes in culture from an MS patient. The PCR was carried out 
30 with Expand TM Long Template PCR System (Boehringer) under the following 
conditions: 94°C 3 min then 93^C 1 min, 60^C 1 min, 68°C 3 min over 10 cycles, 
then 93°C 1 min, 60*^C 1 min with 15 sec of extension at each cycle, 68*^C 3 min over 
35 cycles and 68°C for 7 min and with a final reaction volume of 50 |il. 
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[0084] The primer used for the PGR identified by SEQ ID NO: 1 1 is 5* TCA 
AAA TCG AAG AGC TTT AGA CTT GCT AAC CG 3\ 

[0085] The primer of SEQ ID NO: 11 is specific for the env region of the 
CI 5 clone. 

5 [0086] An amplification product of 1 673 by was obtained and cloned in the 

following manner: 

the amplified DNA was inserted into a plasmid with the aid of the TA 
Cloning kit*^. The various steps were carried out in accordance with the instructions 
for the TA Cloning kit® (Invitrogen). At the end of the procedure, the white colonies 

10 of recombinant bacteria (white) were subcultured so as to be cultured and allow the 
extraction of the plasmids incorporated according to the so-called "miniprep" 
procedure. The plasmid preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose gel. The plasmids possessing 
an insert detected under UV light after staining the gel with ethidium bromide were 

1 5 selected for the sequencing of the insert after hybridization with a primer 

complementary to the T7 promoter present on the cloning plasmid of the TA cloning 
kit®. The reaction prior to the sequencing was then carried out according to the 
method recommended for using the sequencing kit "PRISMTM Ready Reaction 
AmpliTaq'' FS, DyeDeoxy™ Terminator" (Applied Biosystems, ref 4021 19) and the 

20 automated sequencing was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's instructions. 

[0087] The 5M6 clone obtained contains a region corresponding to the 3' 
region of the MSRV-1 envelope of 492 by followed by the regions U3, R and U5 
(837 bp) ofMSRVl. 

25 [0088] The 5M6 clone is represented by its nucleotide sequence (SEQ ID 

NO: 12) in Figure 5. The three potential reading frames of this clone are presented by 
their amino acid sequence under the nucleotide sequence. The reading frame 
corresponding to the C-terminal end of the MSRV-1 env protein is identified by SEQ 
ID NO: 13. 

30 [0089] EXAMPLE 4 ; PREPARATION OF THE LB 16 CLONE 

CONTAINING THE REGION ENCODING THE MSRV-1 RETROVIRUS 
INTEGRASE 
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[0090] An RT-PCR was carried out on the total RNA treated with DNAsel 
and extracted from a choroid plexus obtained from an MS patient. The synthesis of 
cDNA was carried out with an oligo dT primer and the reverse transcriptase "Expand 
TM RT" from Boehringer according to the conditions recommended by the company. 
5 A "no RT" control was carried out in' parallel on the same material. A PGR was 

carried out with Taq polymerase (Perkin Elmer) under the following conditions: 95°C 
5 min, then 95°C 1 min, 55°C 1 min, 72°C 2 min over 35 cycles and 72^C for 8 min 
and with a final reaction volume of 50 jal. 
[0091] Primers used for the PGR: 
10 - 5' primer, identified by SEQ ID NO: 14 

5* GOG ATT GAT AGG AGG GAT GAG 3*; 

- 3* primer, identified by SEQ ID NO: 15 
5' GAT GTG AGG AGG GTG GAA TAG 3' 

[0092] The primer SEQ ID NO: 14 is specific for the pol region of MSRV-1 
15 and more precisely similar to the integrase region described above. The primer SEQ 
ID NO: 15 was defined on sequences of the clones obtained during preliminary tests. 

[0093] An amplification product of about 760 bp was obtained only in the 
test with RT and was cloned in the following manner: 

the amplified DNA was inserted into a plasmid with the aid of the TA 
20 Gloning kit®. The various steps were carried out in accordance with the instructions 
for the TA Gloning kit® (Invitrogen). At the end of the procedure, the white colonies 
of recombinant bacteria (white) were subcultured so as to be cultured and allow the 
extraction of the plasmids incorporated according to the so-called "miniprep" 
procedure. The plasmid preparation of each recombinant colony was cut with an 
25 appropriate restriction enzyme and analyzed on agarose gel. The plasmids possessing 
an insert detected under UV light after staining the gel with ethidium bromide were 
selected for the sequencing of the insert after hybridization with a primer 
complementary to the T7 promoter present on the cloning plasmid of the TA cloning 
kit®. The reaction prior to the sequencing was then carried out according to the 
30 method recommended for using the sequencing kit "PRISMTM Ready Reaction 

AmpliTaqR FS, DyeDeoxy™ Terminator" (Applied Biosystems, ref 4021 19) and the 
automated sequencing was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's instructions. 
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[0094] The LB 16 clone obtained contains the sequences corresponding to 
integrase. The nucleotide sequence of this clone was identified by SEQ ID NO: 16 in 
Figure 1 1 , three reading frames are determined. 

[0095] EXAMPLE 5 ; PREPARATION OF A CLONE 2, CL2, 
5 CONTAINING IN 3' A PORTION HOMOLOGOUS TO THE POL GENE, 

CORRESPONDING TO THE PROTEASE GENE, AND TO THE GAG GENE 
(GM3) CORRESPONDING TO THE NUCLEOCAPSID, AND A NEW 5'CODING 
REGION, CORRESPONDING TO THE GAG GENE MORE SPECIFICALLY THE 
TEMPLATE AND THE CAPSID of MSRV-1. 

10 [0096] A PCR amplification was carried out on the total RNA extracted from 

100 111 of plasma from a patient suffering from MS. A water control, treated imder the 
same conditions, was used as negative control. The synthesis of cDNA was carried 
out with 300 pmol of a random primer (GIBCO-BRL, France) and the reverse 
transcriptase "Expand RT" (BOEHRINGER MANNHEIM, France) according to the 

15 conditions recommended by the company. An amplification by PCR ("polymerase 
chain reaction") was carried out with the enzyme Taq polymerase (Parkin Elmer, 
France) using 10 ^il of cDNA under the following conditions: 94°C 2 min, 55°C 
1 min, 72°C 2 min then 94°C 1 min, 55°C 1 min, 72°C 2 min over 30 cycles and 72°C 
for 7 min with a final reaction volume of 50 nl. 

20 [0097] Primers used for the PCR amplification: 

- 5' primer, identified by SEQ ID NO: 17 

5' CGG ACA TCC AAA GTG ATG GGA AAC G 3'; 

- 3' primer, identified by SEQ ID NO: 18 

5' GGA CAG GAA AGT AAG ACT GAG AAG GC 3' 

25 [0098] A second amplification by so-called "seminested" PCR was carried 

out with a 5' primer situated inside the region already amplified. This second PCR 
was carried out imder the same experimental conditions as those used during the first 
PCR, using 10 |al of the amplification product derived from the first PCR. 
[0099] Primers used for the amplification by seminested PCR: 

30 - 5' primer, identified by SEQ ID NO: 19 

5' CCT AGA ACG TAT TCT GGA GAA TTG GG 3'; 

- 3' primer, identified by SEQ ID NO: 20 

5' TGG CTC TCA ATG GTC AAA CAT ACC CG 3' 
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[0100] The primers SEQ ID NO: 1 8 and SEQ ID NO: 20 are specific for the 
pol region, clone G+E+A, more specifically the E region: nucleotide position No. 423 
to No. 448. The primers used in the 5* region were defined on sequences of clones 
obtained during preliminary tests. 
5 [0101] An amplification product of 1 5 11 bp was obtained from the RNA 

extracted from the plasma of an MS patient. The corresponding fragment was not 
observed for the water control. This amplification product was cloned in the 
following manner. 

[0102] The amplified DNA was inserted into a plasmid with the aid of the 

10 TA Cloning kit™. The 2 ^il of DNA solution were mixed with 5 -^tl of sterile 
distilled water, 1 |al of a 10 times concentrated ligation buffer "lOX LIGATION 
BUFFER", 2 [il of "pCR™ VECTOR" (25 ng/ml)and 1 ^il of "T4 DNA LIGASE". 
This mixture was incubated overnight at 14°C. The following steps were carried out 
in accordance with the instructions of the TA Cloning kit® (Invitrogen). The mixture 

15 was plated after transformation of the ligation into E. coli INVaF* bacteria. At the end 
of the procedure, the white colonies of recombinant bacteria were subcultured so as to 
be cultured and allow the extraction of the plasmids incorporated according to the 
so-called "DNA minipreparation" procedure (17). The plasmid preparation of each 
recombinant colony was cut with the restriction enzyme EcoRI and analyzed on 

20 agarose gel. The plasmids possessing an insert detected under UV light after staining 
the gel with ethidium bromide were selected for the sequencing of the insert after 
hybridization with a primer domplementary to the T7 promoter present on the cloning 
plasmid of the TA cloning kit®. The reaction prior to the sequencing was then carried 
out according to the method recommended for using the sequencing kit "PRISM™ 

25 Ready Reaction Amplitaq® FS, DyeDeoxy™ Terminator" (Applied Biosystems, 
ref. 402 11 9) and the automated sequencing was carried out on the Applied 
Biosystems 373 A and 377 apparatus, according to the manufacturer's instructions. 

[0103] The clone obtained, called CL2, contains a C-terminal region similar 
to the 5' terminal region of the clones G+E+A of MSRV-1, which makes it possible to 

30 define the C-terminal region of the gag gene and a new region corresponding to the 
N-terminal region of the MSRV-1 gag gene. 

[0104] CL2 makes it possible to define a region of 1 5 1 1 by having an open 
reading frame in the N-terminal region of 1077 bp encoding 359 amino acids and a 
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non-open reading frame of 454 bp corresponding to the C-terminal region of the 
MSRV-1 gag gene. 

[0105] The nucleotide sequence of CL2 is identified by SEQ ID NO: 21. It 
is represented in Figure 6 with the potential reading frames in amino acids. 

[0106] The 1077 bp fragment of CL2 encoding 359 amino acids was 
amplified by PGR with the Pwo enzyme (5U/jil) (Boehringer Maimheim, France) 
using 1 III of the DNA minipreparation of clone 2 under the following conditions: 
95°C 1 min, 60*^C 1 min, 72°C 2 min over 25 cycles and with a final reaction volume 
of 50 |j.l with the aid of the primers: 

- 5* primer (BamHI), identified by SEQ ID NO: 23 

5' TGC TGG AAT TCG GGA TCC TAG AAC GTA TTC 3' (30 mer), and 

- 3' primer (Hindlll), identified by SEQ ID NO: 24 

5* AGT TCT GCT CCG AAG CTT AGG CAG ACT TTT 3' (30 mer) corresponding, 
respectively, to the nucleotide sequence of clone 2 at position -9 to 21 and 1066 to 
1095. 

[0107] The fragment obtained by PGR was linearized with BamHI and 
Hindllland subcloned into the expression vectors pET28C and pET21C 
(NOVAGEN) linearized with BamHI and HindllL The sequencing of the DNA of the 
1077 bp fragment of clone 2 in the two expression vectors was carried out according 
to the method recommended for the use of the sequencing kit "PRISM™ Ready 
Reaction Amplitaqe® FS, DyeDeoxy™ Terminator" (Applied Biosystems, ref.4021 19 
and the automated sequencing was carried out on the Applied Biosystems 373 A and 
377 apparatus, according to the manufacturer's instructions. 

[0108] The expression of the nucleotide sequence of the 1077 bp fragment of 
clone 2 by the expression vectors pET28C and pET21C are identified by SEQ ID 
NO: 25 and SEQ ID NO: 26, respectively. 

[0109] EXAMPLE 6 : EXPRESSION OF CLONE 2 IN ESCHERICHIA 

COLI 

[0110] The constructs pET28c-clone 2 (1077 bp) and pET21C-clone 2 (1077 
bp) synthesize, in the bacterial strain BL21 (DE3), a protein fused at the N- and 
C-terminus for the vector pET28C and the C-terminus for the vector pET21C with 6 
Histidines, having an apparent molecular mass of about 45 kDa, identified by 
SDS-PAGE polyacrylamide gel electrophoresis (SDS = Sodiimi Dodecyl Sulfate) 
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(Laemmli, 1970 (1)). The reactivity of the protein was demonstrated towards an 
anti-Histidine monoclonal antibody (DIANOVA) by the Westem-blot technique 
(Towbin et al., 1979 (2)). 

[0111] The recombinant proteins pET28c-clone 2 (1077 bp) and 
5 pET21C-clone 2 (1077 bp) were visualized by SDS-PAGE in the insoluble fraction 
after enzymatic digestion of the bacterial extracts with 50 gl of lysozyme (10 mg/ml) 
and ultrasound lysis. 

[0112] The antigenic properties of the recombinant antigens pET28C-clone 2 
(1077 bp) and pET21C-clone 2 (1077 bp) were tested by Westen blot after 
10 solubilization of the bacterial pellet with 2% SDS and 50 mM P-mercaptoethanol. 
After incubation with sera from patients suffering from multiple sclerosis, the sera 
from neurological controls and the sera from controls at the Blood Transfiasion Center 
(CTS), the immimocomplexes were detected with the aid of an alkaline 
phosphatase-coupled goat serum anti-human IgG and anti-human IgM. 
1 5 [01 13] The results are presented in the table below. 

TABLE 

Reactivity of sera affected by multiple sclerosis and controls with the MSRV-1 
recombinant protein gag clone 2 (1077 bp) = pET21C-clone 2 (1077 bp) and 

pET28C-clone 2 (1077 bp)^ 

20 

DISEASE NUMBER OF NUMBER OF POSITIVE 

INDIVIDUALS TESTED INDIVIDUALS 



MS 15 6 

2(+++),2(++),2(+) 

NEUROLOGICAL 

CONTROLS 2 1 (+++) 

HEALTHY 

CONTROLS (CTS) 22 1 (+/-) 

(a) The strips containing 1 .5 |ig of recombinant antigen pET-gag clone 2 
(1077 bp) exhibit reactivity against sera diluted 1/100. The Westem-Blot 
interpretation is based on the presence or absence of a specific pET-gag clone 2 
25 (1077 bp) band on the strips. Positive and negative controls are included in each 
experiment. 
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[0114] These results show that, under the technical conditions used, about 
40% of the human sera affected by multiple sclerosis which were tested react with the 
recombinant proteins pET28C-clone 2 (1077 bp) and pET21C-clone 2 (1077 bp). 
Reactivity was observed on a neurological control and it is of interest to note that the 
RNAs extracted from this serum, after the reverse transcriptase step, are also 
amplified by PGR in the pol region. This suggests that people who have not declared 
MS may also harbor and express this virus. On the other hand, an apparently healthy 
control (CTS donor) possesses anti-gag (clone 2, 1077 bp) antibodies. This is 
compatible with an immunity acquired against MSRV-1 independently of a declared 
associated autoimmune disease. 

[0115] EXAMPLE 7 ; PREPARATION OF AN LB 13 CLONE 
CONTAINING IN 3' A PORTION HOMOLOGOUS TO CLONE 2 
CORRESPONDING TO THE GAG GENE AND IN 5' A PORTION 
HOMOLOGOUS TO THE 5M6 CLONE CORRESPONDING TO THE U5 LTR 
REGION 

[0116] An RT-PCR ("reverse transcriptase-polymerase chain reaction") was 
carried out using total RNA extracted from virions, obtained from supematants of 
B lymphocyte cells of patients suffering from multiple sclerosis, concentrated by 
ultracentrifiigations. The synthesis of cDNA was carried out with a specific primer 
SEQ ID NO: 27 and the reverse transcriptase "Expand™ RT" from BOEHRINGER 
MANNHEIM according to the conditions recommended by the company. 

[0117] Primer used for the synthesis of the cDNA, identified by SEQ ID 
NO: 27: 

5' CTT GGA GGG TGC ATA ACC AGG GAA T 3' 

[0118] A PCR amplification was carried out with Taq polymerase (Perkin 
Elmer, France) under the following conditions: 94°C 1 min, 55°C 1 min, 72°C 2 min 
over 35 cycles at 72°C for 7 min and with a final reaction volume of 100 |iL 

[0119] Primers used for the PCR amplification: 

- 5' primer, identified by SEQ ID NO: 28 
5' TGT CCG CTG TGC TCC TGA TC 3* 

- 3' primer, identified by SEQ ID NO: 27 

5* CTT GGA GGG TGC ATA ACC AGG GAA T 3' 
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[0120] A second so-called "seminested" PGR amplification was carried out 
with a 3' primer situated inside the region already amplified. This second 
amplification was carried out under the same experimental conditions as those used 
during the first amplification, using 10 |il of the amplification product derived fi*om 
5 the first PGR. 

[0121] Primers used for the "seminested" PGR amplification: 

- 5* primer, identified by SEQ ID NO: 28 
5' TGT GGG GTG TGG TGG TGA TG 3* 

- 3* primer, identified by SEQ ID NO: 29 
10 5' GTA TGT GGT TTT GGA GTG TTT GGG T 3' 

[0122] The primers SEQ ID NO: 27 and SEQ ID NO: 29 are specific for the 
gag region, clone 2 nucleotide position No. 373-397 and No. 433-456. The primers 
used in the 5' region were defined on sequences of the clones obtained during 
preliminary tests. 

1 5 [0123] An amplification product of 764 bp was obtained and cloned in the 

following manner: 

[0124] The amplified DNA was inserted into a plasmid with the aid of the 
TA Gloning kit™. The 2 |il of DNA solution were mixed with 5 |il of sterile distilled 
water, 1 ^l of a 10 times concentrated ligation buffer "lOX LIGATION BUFFER", 

20 2 ^il of "pGR™ VEGTOR" (25 ng/ml)and 1 |al of "T4 DNA LIGASE". This mixture 
was incubated overnight at 14**G. The following steps were carried out in accordance 
with the instructions of the TA Cloning kit® (Invitrogen). The mixture was plated 
after transformation of the ligation into E. coli INVaF' bacteria. At the end of the 
procedure, the white colonies of recombinant bacteria were subcultured so as to be 

25 cultured and allow the extraction of the plasmids incorporated according to the 

so-cadled "DNA minipreparation" procedure (17). The plasmid preparation of each 
recombinant colony was cut with the restriction enzyme EcoRI and analyzed on 
agarose gel. The plasmids possessing an insert detected under UV light after staining 
the gel with ethidium bromide were selected for the sequencing of the insert after 

30 hybridization with a primer complementary to the T7 promoter present on the cloning 
plasmid of the TA cloning kit®. The reaction prior to the sequencing was then carried 
out according to the method recommended for using the sequencing kit "PRISM™ 
Ready Reaction Amplitaq® FS, DyeDeoxy™ Terminator" (Applied Biosystems, 
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ref. 4021 19) and the automated sequencing was carried out on the Applied 
Biosystems 373 A and 377 apparatus, according to the manufacturer's instructions. 

[0125] The LB 13 clone obtained contains an N-terminal region of MSRV-1 
gag gene homologous to clone 2 and an LTR corresponding to a portion of the U5 
5 region. Between the U5 region and gag, a binding site for the transfer RNAs, the PBS 
"primer binding site", was identified. 

[0126] The nucleotide sequence of the 764 bp fragment of the LB 13 clone in 
the plasmid "pCR™ vector" is represented in the identifier SEQ ID NO: 30. 

[0127] The binding site for the transfer RNAs, having a sequence of PBS 
10 tryptophan type, was identified at nucleotide position No. 342-359 of the LB 13 clone. 

[0128] As this same PBS was found in the endogenous copies homologous 
to MSRVl, the endogenous family thus defined is henceforth called HERV W, 
according to the nomenclature proposed for the endogenous retrovirus families 
(W=tryptophan). 

1 5 [0129] A short ORF of about 65 amino acids was found in the U5 region of 

the 5' LTR of the LB 13 clone. 

[0130] Sequence of the ORF (SEQ ID NO: 32): 
PMASNRAITLTAWSKIPFLGIRETKNPRSENTRLATMLEAAHHHFGSSPPLSWE 
LWEQGPQVTIW. 

20 [0131] The corresponding nucleotide sequence starting at an ATG codon is 

capable of being expressed in a subgenomic DNA fi-om a proviral LTR (U3RU5). 

[0132] Another clone, called LAI 5, was obtained on the total RNA extracted 
fi-om virions concentrated by ultracentrifiigation from a culture supematant of 
synoviocytes obtained fi-om a patient suffering from rheumatoid arthritis. The strategy 

25 for amplifying and cloning the LA 15 clone is exactly the same which was used for the 
LB 13 clone. 

[0133] The nucleotide sequence of the LA 15 clone, which is represented in 
the identifier SEQ ID NO: 31, is very similar to the LD13 clone. This suggests that 
the MSVR-1 retrovirus detected in multiple sclerosis has sequences which are similar 
30 to those found in rheumatoid arthritis. 

[0134] EXAMPLE 8 ; RECONSTRUCTION OF AN RU5-GAG REGION 
FROM THE CLONES LB15, LB13, CL2 AND CL17 
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[0135] The clones CL2 and LB 13 have already been described in the 
preceding examples. The LB 15 clone was obtained using the R sequence of the LTR 
of the cl6 clone in order to define a primer in 5' and the antisense primers used are the 
same as for the LB 13 clone. The CL17 clone was obtained by nested RT-PCR using 
the following primers: 

5'-TCATGCAACTGCACTCTTCTGGTCCG-3' (SEQ ID NO: 33) (sense) 
5'-TCTTGCACTAACCTCCACTGTCCGTTGG-3' (SEQ ID NO: 34) 
(antisense) 

5'-ATCCCCCAGTAACAATTTGGTGACCACG-3' (SEQ ID NO: 35) (sense) 
5'-TCGGGTCTAAGAGGGTACTTCCTTTGGTAGG-3' (SEQ ID NO: 36) 
(antisense) 

[0136] The LB 15 clone was obtained from virions obtained by culturing MS 
cells. The LB 17 clone was obtained from culturing plasma from an MS patient. 

[0137] These overlapping clones made it possible to reconstmct an RU5-gag 
sequence with a potential ORF in the gag gene, as presented in Figure 14. 

[0138] EXAMPLE 9 : PREPARATION OF A CLONE 87-23 

[0139] The region corresponding to integrase was amplified and cloned from 
MS plasma using a seminested RT PGR with the following primers situated in the pol 
and env regions of MSRVl . 

[0140] In the pol region: 

5'-TTACGCAGGTCTCAGGGATGAGCTT-3* (SEQ ID NO: 37) 
(sense-primary PGR) 

5'-CGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTA-3' (SEQ ID 
NO: 38) (sense-secondary PGR) 

[0141] In the env region, 

5*-GGTACGGAGGGTTTCATGTAGTTTTGAG-3' (SEQ ID NO: 39) 
(anti-sense primary and secondary PGR) 

[0142] The amplified clone contains 774 bp in the pol/RT region, all the 
integrase region (1 197 bp) and the start of the env region (480 bp). The nucleotide 
sequence corresponding to the integrase region and the translation to amino acids of 
the potential ORF are presented in Figure 15. 
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[0143] EXAMPLE 10 ; CONFIRMATION OF THE PRESENCE OF RNA 
CONTAINING ENV SEQUENCES RELATED TO ERV9 IN THE RETROVIRAL 
PARTICLES ASSOCIATED WITH THE MSRVl GENOME: 

[0144] Sequences related to ERV9 have been found in a minor proportion in 
the virion preparations obtained from MS compared with the MSRVl sequences. The 
existence of phenomena of co-encapsidation of phylogenetically related endogenous 
sequences into retroviral particles produced by a replicative strain has been described. 
Surprisingly, an RNA region comprising an ORF starting in the 3' portion of env and 
continuing potentially into the 3* LTR has been found in various MS samples. In 
order to specify the existence of an ORF, transcription-translation tests were carried 
out and made it possible to show the reality of an env ORF containing the entire 
transmembrane (TM) portion and ending at the start of the putative LTR. However, 
an additional frame (ORFX) follows and continues in the 3' LTR. The two products 
of expression were visualized and their respective ORFs were subcloned. Figure 16 
represents the nucleotide and peptide sequences of the B 1 3 clone already described, 
specifying the ORFs in the truncated env region and in the putative LTR. The 
presence of such RNAs may be responsible for recombinations with the replicative 
strain and consequently generate strains having a modified pathogenicity. 
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